Search CORE

63 research outputs found

Using Intelligent Prefetching to Reduce the Energy Consumption of a Large-scale Storage System

Author: Fares Ribel
Ge Rong
Romoser Brian
Wood Joal
Zong Ziliang
Publication venue: e-Publications@Marquette
Publication date: 06/12/2013
Field of study

Many high performance large-scale storage systems will experience significant workload increases as their user base and content availability grow over time. The U.S. Geological Survey (USGS) Earth Resources Observation and Science (EROS) center hosts one such system that has recently undergone a period of rapid growth as its user population grew nearly 400% in just about three years. When administrators of these massive storage systems face the challenge of meeting the demands of an ever increasing number of requests, the easiest solution is to integrate more advanced hardware to existing systems. However, additional investment in hardware may significantly increase the system cost as well as daily power consumption. In this paper, we present evidence that well-selected software level optimization is capable of achieving comparable levels of performance without the cost and power consumption overhead caused by physically expanding the system. Specifically, we develop intelligent prefetching algorithms that are suitable for the unique workloads and user behaviors of the world\u27s largest satellite images distribution system managed by USGS EROS. Our experimental results, derived from real-world traces with over five million requests sent by users around the globe, show that the EROS hybrid storage system could maintain the same performance with over 30% of energy savings by utilizing our proposed prefetching algorithms, compared to the alternative solution of doubling the size of the current FTP server farm

epublications@Marquette

Crossref

\u3cem\u3eHP-DAEMON\u3c/em\u3e: \u3cem\u3eH\u3c/em\u3eigh \u3cem\u3eP\u3c/em\u3eerformance \u3cem\u3eD\u3c/em\u3eistributed \u3cem\u3eA\u3c/em\u3edaptive \u3cem\u3eE\u3c/em\u3energy-efficient \u3cem\u3eM\u3c/em\u3eatrix-multiplicati\u3cem\u3eON\u3c/em\u3e

Author: Chen Longxiang
Chen Zizhong
Ge Rong
Li Dong
Tan Li
Zong Ziliang
Publication venue: e-Publications@Marquette
Publication date: 01/01/2014
Field of study

The demands of improving energy efficiency for high performance scientific applications arise crucially nowadays. Software-controlled hardware solutions directed by Dynamic Voltage and Frequency Scaling (DVFS) have shown their effectiveness extensively. Although DVFS is beneficial to green computing, introducing DVFS itself can incur non-negligible overhead, if there exist a large number of frequency switches issued by DVFS. In this paper, we propose a strategy to achieve the optimal energy savings for distributed matrix multiplication via algorithmically trading more computation and communication at a time adaptively with user-specified memory costs for less DVFS switches, which saves 7.5% more energy on average than a classic strategy. Moreover, we leverage a high performance communication scheme for fully exploiting network bandwidth via pipeline broadcast. Overall, the integrated approach achieves substantial energy savings (up to 51.4%) and performance gain (28.6% on average) compared to ScaLAPACK pdgemm() on a cluster with an Ethernet switch, and outperforms ScaLAPACK and DPLASMA pdgemm() respectively by 33.3% and 32.7% on average on a cluster with an Infiniband switch

epublications@Marquette

Network Binarization via Contrastive Learning

Author: Nie Liqiang
Shang Yuzhang
Xu Dan
Yan Yan
Zong Ziliang
Publication venue
Publication date: 16/07/2022
Field of study

Neural network binarization accelerates deep models by quantizing their weights and activations into 1-bit. However, there is still a huge performance gap between Binary Neural Networks (BNNs) and their full-precision (FP) counterparts. As the quantization error caused by weights binarization has been reduced in earlier works, the activations binarization becomes the major obstacle for further improvement of the accuracy. BNN characterises a unique and interesting structure, where the binary and latent FP activations exist in the same forward pass (i.e.,

\text{Binarize}(\mathbf{a}_F) = \mathbf{a}_B

). To mitigate the information degradation caused by the binarization operation from FP to binary activations, we establish a novel contrastive learning framework while training BNNs through the lens of Mutual Information (MI) maximization. MI is introduced as the metric to measure the information shared between binary and FP activations, which assists binarization with contrastive learning. Specifically, the representation ability of the BNNs is greatly strengthened via pulling the positive pairs with binary and FP activations from the same input samples, as well as pushing negative pairs from different samples (the number of negative pairs can be exponentially large). This benefits the downstream tasks, not only classification but also segmentation and depth estimation, etc. The experimental results show that our method can be implemented as a pile-up module on existing state-of-the-art binarization methods and can remarkably improve the performance over them on CIFAR-10/100 and ImageNet, in addition to the great generalization ability on NYUD-v2.Comment: Accepted to ECCV 202

arXiv.org e-Print Archive

Lipschitz Continuity Retained Binary Neural Network

Author: Duan Bin
Nie Liqiang
Shang Yuzhang
Xu Dan
Yan Yan
Zong Ziliang
Publication venue
Publication date: 16/07/2022
Field of study

Relying on the premise that the performance of a binary neural network can be largely restored with eliminated quantization error between full-precision weight vectors and their corresponding binary vectors, existing works of network binarization frequently adopt the idea of model robustness to reach the aforementioned objective. However, robustness remains to be an ill-defined concept without solid theoretical support. In this work, we introduce the Lipschitz continuity, a well-defined functional property, as the rigorous criteria to define the model robustness for BNN. We then propose to retain the Lipschitz continuity as a regularization term to improve the model robustness. Particularly, while the popular Lipschitz-involved regularization methods often collapse in BNN due to its extreme sparsity, we design the Retention Matrices to approximate spectral norms of the targeted weight matrices, which can be deployed as the approximation for the Lipschitz constant of BNNs without the exact Lipschitz constant computation (NP-hard). Our experiments prove that our BNN-specific regularization method can effectively strengthen the robustness of BNN (testified on ImageNet-C), achieving state-of-the-art performance on CIFAR and ImageNet.Comment: Paper accepted to ECCV 202

arXiv.org e-Print Archive

Reduce, Reuse, Recycle: Improving Training Efficiency with Distillation

Author: Blakeney Cody
Forde Jessica Zosa
Frankle Jonathan
Leavitt Matthew L.
Zong Ziliang
Publication venue
Publication date: 01/11/2022
Field of study

Methods for improving the efficiency of deep network training (i.e. the resources required to achieve a given level of model quality) are of immediate benefit to deep learning practitioners. Distillation is typically used to compress models or improve model quality, but it's unclear if distillation actually improves training efficiency. Can the quality improvements of distillation be converted into training speed-ups, or do they simply increase final model quality with no resource savings? We conducted a series of experiments to investigate whether and how distillation can be used to accelerate training using ResNet-50 trained on ImageNet and BERT trained on C4 with a masked language modeling objective and evaluated on GLUE, using common enterprise hardware (8x NVIDIA A100). We found that distillation can speed up training by up to 1.96x in ResNet-50 trained on ImageNet and up to 1.42x on BERT when evaluated on GLUE. Furthermore, distillation for BERT yields optimal results when it is only performed for the first 20-50% of training. We also observed that training with distillation is almost always more efficient than training without distillation, even when using the poorest-quality model as a teacher, in both ResNet-50 and BERT. Finally, we found that it's possible to gain the benefit of distilling from an ensemble of teacher models, which has O(n) runtime cost, by randomly sampling a single teacher from the pool of teacher models on each step, which only has a O(1) runtime cost. Taken together, these results show that distillation can substantially improve training efficiency in both image classification and language modeling, and that a few simple optimizations to distillation protocols can further enhance these efficiency improvements

arXiv.org e-Print Archive

Improving write performance by enhancing internal parallelism of Solid State Drives

Author: Mohammed I. Alghamdi
Xiao Qin
Xiaojun Ruan
Xunfei Jiang
Yun Tian
Ziliang Zong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Abstract—Most researches of Solid State Drives (SSDs) archi-tectures rely on Flash Translation Layer (FTL) algorithms and wear-leveling; however, internal parallelism in Solid State Drives has not been well explored. In this research, we proposed a new strategy to improve SSD write performance by enhancing internal parallelism inside SSDs. A SDRAM buffer is added in the design for buffering and scheduling write requests. Because the same logical block numbers may be translated to different physical numbers at different times in FTL, the on-board SDRAM buffer is used to buffer requests at the lower level of FTL. When the buffer is full, same amount of data will be assigned to each storage package in SSDs to enhance internal parallelism. To accurately evaluate performance, we use both synthetic workloads and real-world applications in experiments. We compare the enhanced internal parallelism scheme with the traditional LRU strategy since it is unfair to compare an SSD having buffer with an SSD without a buffer. The simulation results demonstrate that the writing performance of our design is significantly improved compared with the LRU-cache strategy with the same amount of buffer sizes. I

CiteSeerX

Crossref

Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

Author: Abusleme Angel
Adam Thomas
Ahmad Shakeel
Ahmed Rizwan
Aiello Sebastiano
Akram Muhammad
Aleem Abid
An Fengpeng
An Qi
Andronico Giuseppe
Anfimov Nikolay
Antonelli Vito
Antoshkina Tatiana
Asavapibhop Burin
Auguste Didier
Bai Weidong
Balashov Nikita
Baldini Wander
Barresi Andrea
Basilico Davide
Baussan Eric
Bellato Marco
Beretta Marco
Bergnoli Antonio
Bick Daniel
Bieger Lukas
Biktemerova Svetlana
Birkenfeld Thilo
Blum David
Blyth Simon
Bolshakova Anastasia
Bongrand Mathieu
Bordereau Clément
Breton Dominique
Brigatti Augusto
Brugnera Riccardo
Bruno Riccardo
Budano Antonio
Busto Jose
Cabrera Anatael
Caccianiga Barbara
Cai Hao
Cai Xiao
Cai Yanke
Cai Zhiyan
Callier Stéphane
Cammi Antonio
Campeny Agustin
Cao Chuanya
Cao Guofu
Cao Jun
Caruso Rossella
Cerna Cédric
Cerrone Vanessa
Chan Chi
Chang Jinfan
Chang Yun
Chatrabhuti Auttakit
Chen Chao
Chen Guoming
Chen Pingping
Chen Shaomin
Chen Yixue
Chen Yu
Chen Zhangming
Chen Zhiyuan
Chen Zikang
Cheng Jie
Cheng Yaping
Cheng Yu Chin
Chepurnov Alexander
Chetverikov Alexey
Chiesa Davide
Chimenti Pietro
Chin Yen-Ting
Chu Ziliang
Chukanov Artem
Claverie Gérard
Clementi Catia
Clerbaux Barbara
Coppi Alberto
Corso Flavio Dal
Corti Daniele
Csakli Simon
Dalager Olivia
Datta Jaydeep
de André João Pedro Athayde Marcondes
De La Taille Christophe
Deng Zhi
Deng Ziyan
Di Lorenzo Selma Conforti
Ding Xiaoyu
Ding Xuefeng
Ding Yayun
Dirgantara Bayu
Dittrich Carsten
Dmitrievsky Sergey
Dohnal Tadeas
Dolzhikov Dmitry
Donchenko Georgy
Dong Jianmeng
Doroshkevich Evgeny
Dou Wei
Dracos Marcos
Druillole Frédéric
Du Ran
Du Shuxian
Dugas Katherine
Dusini Stefano
Duyang Hongyue
Eck Jessica
Enqvist Timo
Fabbri Andrea
Fahrendholz Ulrike
Fan Lei
Fang Jian
Fang Wenxing
Fargetta Marco
Fedoseev Dmitry
Fei Zhengyong
Feng Li-Cheng
Feng Qichun
Ferraro Federico
Fournier Amélie
Gan Haonan
Gao Feng
Garfagnini Alberto
Gavrikov Arsenii
Giammarchi Marco
Giudice Nunzio
Gonchar Maxim
Gong Guanghua
Gong Hui
Gornushkin Yuri
Grassi Marco
Gromov Maxim
Gromov Vasily
Gu Minghao
Gu Xiaofei
Gu Yu
Guan Mengyun
Guan Yuduo
Guardone Nunzio
Guo Cong
Guo Wanlei
Guo Xinheng
Göttel Alexandre
Hagner Caren
Han Ran
Han Yang
Hariharan Vidhya Thara
He Miao
He Wei
Heinz Tobias
Hellmuth Patrick
Heng Yuekun
Herrera Rafael
Hor YuenKeung
Hou Shaojing
Hsiung Yee
Hu Bei-Zhen
Hu Hang
Hu Jianrun
Hu Jun
Hu Shouyang
Hu Tao
Hu Yuxiang
Hu Zhuojun
Huang Guihong
Huang Hanxiong
Huang Jinhao
Huang Junting
Huang Kaixuan
Huang Wenhao
Huang Xin
Huang Xingtao
Huang Yongbo
Hui Jiaqi
Huo Lei
Huo Wenju
Huss Cédric
Hussain Safeer
Imbert Leonard
Ioannisian Ara
Isocrate Roberto
Jafar Arshak
Jelmini Beatrice
Jeria Ignacio
Ji Xiaolu
Jia Huihui
Jia Junji
Jian Siyu
Jiang Cailian
Jiang Di
Jiang Wei
Jiang Xiaoshan
Jing Xiaoping
Jollet Cécile
Kampmann Philipp
Kang Li
Karaparambil Rebin
Kazarian Narine
Khan Ali
Khatun Amina
Khosonthongkee Khanchai
Korablev Denis
Kouzakov Konstantin
Krasnoperov Alexey
Kuleshov Sergey
Kutovskiy Nikolay
Labit Loïc
Lachenmaier Tobias
Landini Cecilia
Leblanc Sébastien
Lebrin Victor
Lefevre Frederic
Lei Ruiting
Leitner Rupert
Leung Jason
Li Demin
Li Fei
Li Fule
Li Gaosong
Li Huiling
Li Jiajun
Li Mengzhao
Li Min
Li Nan
Li Qingjiang
Li Ruhui
Li Rui
Li Shanfeng
Li Tao
Li Teng
Li Weidong
Li Weiguo
Li Xiaomei
Li Xiaonan
Li Xinglong
Li Yi
Li Yichen
Li Yufeng
Li Zhaohan
Li Zhibing
Li Ziyuan
Li Zonghai
Liang Hao
Liang Hao
Liao Jiajun
Limphirat Ayut
Lin Guey-Lin
Lin Shengxin
Lin Tao
Ling Jiajie
Ling Xin
Lippi Ivano
Liu Caimei
Liu Fang
Liu Fengcheng
Liu Haidong
Liu Haotian
Liu Hongbang
Liu Hongjuan
Liu Hongtao
Liu Hui
Liu Jianglai
Liu Jiaxi
Liu Jinchang
Liu Min
Liu Qian
Liu Qin
Liu Runxuan
Liu Shenghui
Liu Shubin
Liu Shulin
Liu Xiaowei
Liu Xiwen
Liu Xuewei
Liu Yankai
Liu Zhen
Lokhov Alexey
Lombardi Paolo
Lombardo Claudio
Loo Kai
Lu Chuan
Lu Haoqi
Lu Jingbin
Lu Junguang
Lu Peizhi
Lu Shuxiang
Lu Xianguo
Lubsandorzhiev Bayarto
Lubsandorzhiev Sultim
Ludhova Livia
Lukanov Arslan
Luo Daibin
Luo Fengjiao
Luo Guang
Luo Jianyi
Luo Shu
Luo Wuming
Luo Xiaojie
Lyashuk Vladimir
Ma Bangzheng
Ma Bing
Ma Qiumei
Ma Si
Ma Xiaoyan
Ma Xubo
Maalmi Jihane
Magoni Marco
Mai Jingyu
Malyshkin Yury
Mandujano Roberto Carlos
Mantovani Fabio
Mao Xin
Mao Yajun
Mari Stefano M.
Marini Filippo
Martini Agnese
Mayer Matthias
Mayilyan Davit
Mednieks Ints
Meng Yue
Meraviglia Anita
Meregaglia Anselmo
Meroni Emanuela
Meyhöfer David
Miramonti Lino
Mohan Nikhil
Molla Marta Colomer
Montuschi Michele
Morton-Blake Iwan
Müller Axel
Nastasi Massimiliano
Naumov Dmitry V.
Naumova Elena
Navas-Nicolas Diana
Nemchenok Igor
Nikolaev Alexey
Ning Feipeng
Ning Zhe
Nunokawa Hiroshi
Oberauer Lothar
Ochoa-Ricoux Juan Pedro
Olshevskiy Alexander
Orestano Domizia
Ortica Fausto
Othegraven Rainer
Paoloni Alessandro
Parmeggiano Sergio
Pei Yatian
Pelicci Luca
Peng Anguo
Peng Haiping
Peng Yu
Peng Zhaoyuan
Perrot Frédéric
Petitjean Pierre-Alexandre
Petrucci Fabrizio
Pilarczyk Oliver
Popov Artyom
Poussot Pascal
Previtali Ezio
Qi Fazhi
Qi Ming
Qi Xiaohui
Qian Sen
Qian Xiaohui
Qian Zhen
Qiao Hao
Qin Zhonghua
Qiu Shoukang
Qu Manhao
Qu Zhenning
Ranucci Gioacchino
Rasheed Reem
Re Alessandra
Rebii Abdel
Redchuk Mariia
Ren Bin
Ren Jie
Ricci Barbara
Rico Luis Felipe Piñeres
Rientong Komkrit
Rifai Mariam
Roche Mathieu
Rodphai Narongkiat
Romani Aldo
Roskovec Bedřich
Ruan Xichao
Rybnikov Arseniy
Sadovsky Andrey
Saggese Paolo
Sandanayake Deshan
Sangka Anut
Sava Giuseppe
Sawangwit Utane
Schever Michaela
Schwab Cédric
Schweizer Konstantin
Selyunin Alexandr
Serafini Andrea
Settimo Mariangela
Sharov Vladislav
Shaydurova Arina
Shi Jingyan
Shi Yanan
Shutov Vitaly
Sidorenkov Andrey
Singhal Apeksha
Sirignano Chiara
Siripak Jaruchit
Sisti Monica
Smirnov Mikhail
Smirnov Oleg
Sogo-Bezerra Thiago
Sokolov Sergey
Songwadhana Julanan
Soonthornthum Boonrucksar
Sotnikov Albert
Sreethawong Warintorn
Stahl Achim
Stanco Luca
Stankevich Konstantin
Steiger Hans
Steinmann Jochen
Sterr Tobias
Stock Matthias Raphael
Strati Virginia
Studenikin Alexander
Su Aoqi
Su Jun
Sun Shifeng
Sun Xilei
Sun Yongjie
Sun Yongzhao
Sun Zhengyang
Suwonjandee Narumon
Szelezniak Michal
Takenaka Akira
Tang Jian
Tang Qiang
Tang Quan
Tang Xiao
Theisen Eric
Thi Minh Thuan Nguyen
Tietzsch Alexander
Tkachev Igor
Tmej Tomas
Torri Marco Danilo Claudio
Tortorici Francesco
Treskov Konstantin
Triossi Andrea
Triozzi Riccardo
Trzaska Wladyslaw
Tung Yu-Chen
Tuve Cristina
Ushakov Nikita
Vedin Vadim
Venettacci Carlo
Verde Giuseppe
Vialkov Maxim
Viaud Benoit
Vollbrecht Cornelius Moritz
von Sturm Katharina
Vorobel Vit
Voronin Dmitriy
Votano Lucia
Walker Pablo
Wang Caishen
Wang Chung-Hsiang
Wang En
Wang Guoli
Wang Jian
Wang Jun
Wang Li
Wang Lu
Wang Meng
Wang Meng
Wang Ruiguang
Wang Siguang
Wang Wei
Wang Wenshuai
Wang Xi
Wang Xiangyue
Wang Yangfu
Wang Yaoguang
Wang Yi
Wang Yi
Wang Yifang
Wang Yuanqing
Wang Yuyi
Wang Zhe
Wang Zheng
Wang Zhimin
Watcharangkool Apimook
Wei Wei
Wei Wei
Wei Wenlu
Wei Yadong
Wei Yuehuan
Wen Kaile
Wen Liangjian
Weng Jun
Wiebusch Christopher
Wirth Rosmarie
Wonsak Bjoern
Wu Diru
Wu Qun
Wu Yiyang
Wu Zhi
Wurm Michael
Wurtz Jacques
Wysotzki Christian
Xi Yufei
Xia Dongmei
Xiao Fei
Xiao Xiang
Xie Xiaochuan
Xie Yuguang
Xie Zhangquan
Xin Zhao
Xing Zhizhong
Xu Benda
Xu Cheng
Xu Donglian
Xu Fanrong
Xu Hangkun
Xu Jilei
Xu Jing
Xu Meihang
Xu Xunjie
Xu Yin
Xu Yu
Yan Baojun
Yan Qiyu
Yan Taylor
Yan Xiongbo
Yan Yupeng
Yang Changgen
Yang Chengfeng
Yang Jie
Yang Lei
Yang Xiaoyu
Yang Yifan
Yang Yifan
Yao Haifeng
Ye Jiaxuan
Ye Mei
Ye Ziping
Yermia Frédéric
You Zhengyun
Yu Boxiang
Yu Chiye
Yu Chunxu
Yu Guojun
Yu Hongzhao
Yu Miao
Yu Xianghui
Yu Zeyuan
Yu Zezhong
Yuan Cenxi
Yuan Chengzhuo
Yuan Ying
Yuan Zhenxiong
Yue Baobiao
Zafar Noman
Zavadskyi Vitalii
Zeng Fanrui
Zeng Shan
Zeng Tingxuan
Zeng Yuda
Zhan Liang
Zhang Aiqiang
Zhang Bin
Zhang Binting
Zhang Feiyang
Zhang Haosen
Zhang Honghao
Zhang Jialiang
Zhang Jiawen
Zhang Jie
Zhang Jingbo
Zhang Jinnan
ZHANG Lei
Zhang Mohan
Zhang Peng
Zhang Ping
Zhang Qingmin
Zhang Shiqi
Zhang Shu
Zhang Shuihan
Zhang Siyuan
Zhang Tao
Zhang Xiaomei
Zhang Xin
Zhang Xuantong
Zhang Yinhong
Zhang Yiyu
Zhang Yongpeng
Zhang Yu
Zhang Yuanyuan
Zhang Yumei
Zhang Zhenyu
Zhang Zhijian
Zhao Jie
Zhao Rong
Zhao Runze
Zhao Shujun
Zheng Dongqin
Zheng Hua
Zheng Yangheng
Zhong Weirong
Zhou Jing
Zhou Li
Zhou Nan
Zhou Shun
Zhou Tong
Zhou Xiang
Zhu Jingsen
Zhu Kangfu
Zhu Kejun
Zhu Zhihang
Zhuang Bo
Zhuang Honglin
Zong Liang
Zou Jiaheng
Züfle Jan
Šimkovic Fedor
Šrámek Ondřej
Publication venue
Publication date: 13/09/2023
Field of study

Core-collapse supernova (CCSN) is one of the most energetic astrophysical events in the Universe. The early and prompt detection of neutrinos before (pre-SN) and during the SN burst is a unique opportunity to realize the multi-messenger observation of the CCSN events. In this work, we describe the monitoring concept and present the sensitivity of the system to the pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), which is a 20 kton liquid scintillator detector under construction in South China. The real-time monitoring system is designed with both the prompt monitors on the electronic board and online monitors at the data acquisition stage, in order to ensure both the alert speed and alert coverage of progenitor stars. By assuming a false alert rate of 1 per year, this monitoring system can be sensitive to the pre-SN neutrinos up to the distance of about 1.6 (0.9) kpc and SN neutrinos up to about 370 (360) kpc for a progenitor mass of 30

M_{\odot}

for the case of normal (inverted) mass ordering. The pointing ability of the CCSN is evaluated by using the accumulated event anisotropy of the inverse beta decay interactions from pre-SN or SN neutrinos, which, along with the early alert, can play important roles for the followup multi-messenger observations of the next Galactic or nearby extragalactic CCSN.Comment: 24 pages, 9 figure

arXiv.org e-Print Archive